大規模並列プロセッサのプログラミング：実践的なアプローチ：汎用GPUアーキテクチャへの進化

次のアーキテクチャから NVIDIA GT200 へ移行する際、 Fermiアーキテクチャは GPU計算の第3世代の誕生を意味しています前世代のアーキテクチャが数学向けにグラフィックス処理ユニットを改造したものだったのに対し、Fermiは GPGPU（汎用GPU） アプリケーションのために完全に設計されたものです。

1. グラフィックス中心から計算中心へ

GT200のようにテクスチャユニットと厳密なデータ並列性に焦点を当てていたのに対し、Fermiは統合されたメモリリクエストパスを導入しました。この変化により 計算的思考開発者は単純な2次元グリッドマッピングから離れ、複雑なC++アルゴリズムの実装へと進むことが可能になりました。

2. メモリ階層の飛躍

Fermiは本格的な L1/L2キャッシュ階層 および IEEE 754-2008 浮動小数点標準への準拠を導入しました。これにより、研究者は各バイトごとに「スクラッチパッド」メモリ（共有メモリ）を手動で管理する必要がなくなり、不規則なデータ構造や科学的工学に適した倍精度の正確性を実現できるようになりました。

TERMINALbash — 80x24

> Ready. Click "Run" to execute.

QUESTION 1

Which architecture is considered the true start of the 'Third Generation' of GPU computing?

GT200 (Tesla)

Fermi

G80

Fixed-function Pipeline

QUESTION 2

What memory feature was introduced in Fermi to help handle irregular data patterns?

Manual Scratchpad only

Hardware-managed L1/L2 Cache Hierarchy

Write-only Texture Buffers

Disabling Global Memory

QUESTION 3

Fermi's compliance with IEEE 754-2008 was critical for which application type?

Simple 2D Sprite Rendering

High-precision Scientific Computing (FP64)

Text Scrolling

Basic Vertex Shading

QUESTION 4

What does 'Computational Thinking' refer to in the context of the Fermi shift?

Treating the GPU as a fixed-function signal processor.

Focusing on the physics of the problem rather than manual data movement.

Manually coding assembly for every pixel.

Using only 2D textures for storage.

QUESTION 5

How did Fermi improve thread management?

It removed the concept of Warps.

It introduced sophisticated hardware thread scheduling.

It limited threads to only 32 per GPU.

It forced all threads to run the same instruction forever.